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Abstract 

Solar UV emission has a profound impact on the upper terrestrial 
atmosphere. Because of instrumental constraints, however, solar prox- 
ies often need to be used as substitutes for the solar spectral variability. 
Finding proxies that properly reproduce specific spectral bands or lines 
is an ongoing problem. Using daily observations from 2003 to 2008 and a 
multiscale statistical approach, we test the performances of 9 proxies for 
the UV solar flux. Their relevance is evaluated at difi'erent time-scales 
and a novel representation allows all quantities to be compared simulta- 
neously. This representation reveals which proxies are most appropriate 
for different spectral bands and for different time scales. 
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1 Why are solar proxies needed ? 

The solar irradiance in wavelengths shortward of 300 nm is a key parameter 
for the specification of the upper terrestrial atmosphere [7]. Variations are 
observed on time-scales ranging from seconds to years and can impact radio 
wave propagation, satellite orbits through increased air-drag but also global 
Earth climate. Unfortunately, there has been no long-term and continuous 
measurement of the full solar UV spectrum until Feb. 2002, when the TIMED 
satellite started operating. Even today, the continuous measurement of solar 
irradiance with sufficient temporal resolution and radiometric accuracy remains 
a major instrumental challenge [25]. A important issue is the identification of 
proper substitutes (i.e. proxies) of the solar UV flux for upper atmospheric 
modehng [1-5]. 

Here, we test the performance of nine proxies for various UV spectral bands 
that encompass emissions coming from the solar corona down to the photo- 
sphere. The variability should therefore be strongly wavelength dependent, 
even though different solar layers are coupled. The solar UV radiation mostly 
affects the terrestrial atmosphere through photoionization and photochemistry. 



1 




X [nm] 

Figure 1: a) Average solar irradiance for the considered time interval; b) Vari- 
ability, defined here as f 00 x (maximum- minimum) /average irradiance over the 
period Aug. 2003 - Feb. 2008; c) Altitude of unit optical depth, an indicator 
of the altitude of maximum atmospheric absorption. The spectral bands refer 
to the naming conventions used in Sec. 2. 

which are again wavelength-dependent processes [7, 15]. The radiation in the 
MUV range (200-300 nm) mostly affects the stratospheric O3 concentration; 
the FUV range (122-200 nm) affects the upper mesospheric O2 excitation pro- 
duction and the lower thermospheric O2 dissociation; the EUV range (10-120 
nm) affects the thermospheric O, O2 and N2 ionization and excitation produc- 
tions. Other effects include the impact of the intense H I Lyman-a line at 
121.57 nm on nitric oxides, which are important for climatological considera- 
tions. The altitude of strongest absorption is shown in Fig. 1, together with the 
average spectral irradiance. No single proxy can reproduce the solar variability 
over the whole UV spectrum. Our prime objective therefore is to compare the 
measured irradiance in these bands to various proxies that are measured by 
independent means, partly from ground instruments. Several of these proxies 
have the advantage of not suffering from instrument degradation and so are 
easier to maintain in the long. 

Many authors have already evaluated the relevance of solar proxies for up- 
per atmosphere specification. Physical models of the solar irradiance (e.g. 
[8]) are not yet accurate enough below 400 nm, so most studies are based on 
experimental data, using statistical analyses or by comparing specific events 
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[17, 12, 6]. Some authors have tested upper atmospheric models with different 
solar inputs [21]. Various empirical models use such proxies to deliver solar UV 
spectra [24, 19]. 

Our approach is statistical as well. We use daily spectra from the TIMED 
and SORCE missions, which provide uninterrupted coverage of the full UV 
spectral range. In contrast to previous studies, however, we first decompose 
the different quantities into several time-scales before comparing them. Indeed, 
since different time-scales (e.g. solar rotation period and solar cycle) capture 
different physical processes, a multiscale approach is needed. The second nov- 
elty is a graphical representation, similar to the one used by [5], which, for the 
first time gives a global picture of how all proxies and spectral bands are related 
to each others, without having to go through the tedious visual comparison of 
scatter plots or correlation coefficients. 

2 The data and the method 

We consider daily measurements, covering the declining phase of the solar cy- 
cle, from from Aug. 2, 2003 until February 1, 2008. Flares are not included 
here because most instruments don't properly resolve them. The Solar flare 
[22] and FISM [2] models have been specifically designed to model the short 
term spectral variability during flares by incorporating high cadence soft X-ray 
measurements . 

Our spectral measurements are a composite of three different instruments. 
The 1-27 nm and the 121-122 nm ranges are covered by the XPS photometers 
onboard SORCE [20]. The data (version 9) in that range are not purely ob- 
servational since the CHIANTI model is used to compute the irradiance in 1 
nm bins. The 27-115 nm range is covered by the EGS spectrograph onboard 
TIMED [25], whose spectral resolution is 0.4 nm (version 9, the data are re- 
binned to 1 nm). The remaining wavelengths are covered by the SOLSTICE 
spectrometers onboard SORCE [20] with a resolution of 1 nm (version 16). 

In the following, we concentrate on 5 spectral bands that are considered to 
be important for aeronomy [15, 23], see Fig. 1. We shall call them XUV (0.5- 
10 nm), EUV (20-60 nm), Lyman-a (121-122 nm), FUV (130-170 nm) and 
MUV (220-270 nm). Note that these definitions differ from the ISO 21348:200 
standard that was used in Sec. 1. The proxies are: 

1. ISN, the international sunspot number (from SIDC, Brussels), which is 
not really a UV proxy but remains the most widely used gauge of solar 
activity. 

2. flO.7 is the radio flux at 10.7 cm (from Penticton Observatory, Canada). 
This proxy is widely used as a solar input to ionosphere/thermosphere 
models, partly because it can be conveniently measured from ground. 

3. Mgll is the corc-to-wing ratio of the Mg II line at 280 nm (from 
SORCE/SOLSTICE, version 9). This index probes the high chromo- 
sphere and is often advocated for the FUV [9]. 

4. CaK is the normalized intensity of the Ca II K-linc at 393 nm (from 
National Solar Observatory at Sacramento Peak). This line originates at 
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nearly the same altitude as the Mg II line and has also been advocated 
for the FUV [14]. 

5. MPSI is the magnetic plage strength index (from the Mt. Wilson 150- 
Foot Solar Tower) , which quantifies the relative fraction of the solar sur- 
face that is covered by mild magnetic fields (10 < |B| < 100 Gauss). By 
definition, the MPSI index is a proxy for plages and faculae [17]. 

6. MWSI is the Mount Wilson sunspot index, defined as the MPSI, but for 
intense magnetic fields (|B| > 100 Gauss). The MWSI is a proxy for 
active regions. 

7. slO.7 is computed by [23] out of the integrated 26-34 nm emission from 
the SEM radiometer onboard SoHO, and rescaled to the flO.7 index after 
a trend correction (version 3.9a). This proxy is dominated by the emission 
from the chromospheric and transition region He II line at 30.4 nm. 

8. Lyman-a channel (ch-L) is the expected output of a photodiode from the 
LYRA radiometer [10] onboard the PR0BA2 satellite, whose launch is 
scheduled for the end of 2009. This channel integrates emissions in the 
110-210 nm band, with a peak around 125 nm. We reconstructed the 
signal from SORCE/SOLSTICE data. 

9. Herzberg channel (ch-H) is the expected output of a photodiode from 
LYRA in the Herzberg band, here between 195-220 nm. Both the 
Herzberg and the Lyman-a channels arc relevant inputs for upper at- 
mospheric models. 

Note that the last three quantities are actually derived from UV irradiance 
measurements and so do not really qualify as proxies. Together with the Mgll 
index, they must be measured from space. 

To compare different time-scales, we first decompose each quantity into dif- 
ferent scales using the a trous wavelet decomposition [Ki]. The decomposition 
imposes scales that are centered on 3, 6, 12, 24, ... , 768 days. The components 
associated with scales of 24 and 96 days are illustrated in Fig. 2. The first 
scale captures solar rotation effects and the second one the long-term evolution 
of active regions. The most conspicuous result in Fig. 2 is the remarkable 
coherence of all quantities, which justifies a posteriori the widespread use of 
proxies for the solar UV flux. 

All quantities are affected by various types of noise. The signal-to-noise 
ratio of the irradiance degrades as the flux drops with decreasing solar cycle 
and detector degradation increases. Data gaps in the CaK (with 22% coverage 
and a median gap duration of 3 days), MPSI and MWSI (75 % coverage, 1 day 
median) indices were filled by multivariate interpolation [13]. Gap filling affects 
the variability on the shortest time-scales (up to about 6 days) by enhancing 
the correlation with other proxies. The smallest scales require more careful 
analysis and are therefore discarded here. 

3 The proxy-to-spectral band relation 

Since all quantities are strongly correlated, we use the Pearson correlation 
coefficient r^y = Sxy/{sxSy) (where Sxy is the sample covariance, and is 
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Figure 2: Decomposition of the 5 spectral bands and the 9 proxies into two 
different scales: solar rotation period (top) and an average time-scale of 96 
days (bottom). Amplitudes are arbitrary. The legends are given in the text. 
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the sample variance) as a gauge of linear correspondence between x and y. 
Unfortunately, this quantity can only be computed pairwise, leaving us with a 
large table of values to decipher. Better insight can be gained by turning these 
values into a graph, using the multidimensional scaling technique [1]. The idea 
consists in representing all irradiances and proxies on a 2D correspondence map 
in such a way that their mutual distance equals their dissimilarity, defined here 
as d = 1 — |r|. Such a map allows the proxy-to-spectral band relation to be 
investigated in detail. Two close variants of this approach have already been 
used by [5, 4]. 

In principle, we would need a 13-dimensional plot to represent all quantities 
while accounting for their mutual distances exactly [1]. However, because of 
the strong coherency of the spectral variability, a 2D (and sometimes even a 
ID) approximation already captures the salient statistical features. We verified 
that a third dimension does not bring new insight here. 

Figure 3 is the central result of this study, as it shows the correspondence 
between proxies and spectral bands for three typical scales. What matters in 
these plots is the distance between each pair, and not the absolute positions 
or the axes, which do not carry any direct meaning. The closer two quantities 
are, the more correlated they are. The correspondence map therefore provides 
both quantitative (the distance is related to r) and qualitative information 
on how all quantities are related to each other. The top panel covers time- 
scales of typically half a solar rotation, the middle panel one solar rotation and 
the bottom panel several rotations. It comes as no surprise that quantities 
with strong physical connections, such as the Lyman-a band and the Lyman-a 
channel, always stick together. 

Interestingly, all spectral bands are roughly distributed along a line, reveal- 
ing a gradual transition from optically thin and energetic emissions on the left 
to optically thick ones on the right. For time-scales below one solar rotation 
(upper plot), this ordering manifests itself in the ccnter-to-limb variation, with 
brightenings in the XUV and EUV bands, and darkenings in the FUV and 
MUV bands [3]. Our maps show that the same gradual transition persists for 
longer time-scales. The relative location of proxies with respect to this align- 
ment changes substantially with time-scale, which confirms and highlights the 
importance of a multiscale approach. 

The bottom panel focuses on time-scales that encompass the decay time of 
active regions. Similar distributions are observed for larger scales but the solar 
cycle time scale could not be addressed by lack of data. Some correlations are 
difficult to explain, such as the close location of the Mgll index and the EUV 
band, already noted by [11]. Equally surprising is the difference between the 
Mgll and CaK indices, which probe nearly the same solar altitude. 

4 Conclusions and Recommendations 

What are the best proxies for reconstructing specific spectral bands ? Figure 
3 precisely tells us that the answer very much depends on the time-scale of 
interest. Several general conclusions can nevertheless be drawn. 

1. Proxies that arc derived from real irradiancc data do indeed match their 
corresponding spectral band quite well. The correlation between the 
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Figure 3: Correspondence maps for three characteristic scales. The distance 
between each pair of points corresponds to 1 — |r| (see text) and the hne thick- 
ness is proportional to r. Capitals designate our 5 spectral bands: (X)UV, 
(E)UV, H I (L)yman-Q!, (F)UV and (M)UV. The other characters correspond 
to proxies: (i)sn, (f)10.7, (s)10.7, M(g)II, (c)aK, M(p)SI, M(w)SI, L(y)man-a 
channel, (h)erzberg channel. 
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LYRA Lyman-a channel and the Lyman-a band, for example, always 
exceeds 0.95. 

2. The Mgll index shows the best global performance since it is always 
located close to the center of the cloud of points. It is particularly well 
suited for the FUV band. The MPSI index is a backup solution, but is 
not measured continuously. 

3. Apart from these, however, no single spectral band can be properly re- 
constructed at all scales from one single proxy. 

4. There is no good (non irradiance-derived) proxy for the XUV and EUV 
bands. The flO.7 index would be the least bad solution. 

5. None of our proxies properly fits the MUV band, for which a better gauge 
of photospheric emissions is needed. 

The correspondence map can also be used to search for new proxies. In- 
deed, when three quantities are closely spaced and aligned, then the one in the 
middle be approximated by taking a linear combination of the two others. Two 
spectral bands whose reconstruction can be improved that way are the FUV 
and the Lyman-a bands. Reciprocally, remotely located quantities are hardest 
to reconstruct. This is particularly evident for the MUV and the XUV bands. 
For the same reason, the sunspot number and the MWSI should not be used 
for any reconstruction as they capture variations that are not reproduced by 
the irradiance. 

We are currently developing this multiscale approach for more accurate 
reconstructions of the previously considered UV bands. The EUV band, for 
example, can be reconstructed as a sum of contributions from different proxies, 
decomposed into different scales and with different weights. Our tests show 
that the residual error between the observed and modeled flux can be reduced 
that way by a factor of 2 to 4 as compared to models that do not consider 
multiple scales. These results will be detailed in a forthcoming publication. A 
second improvement consists in incorporating the effect of a convolution with 
time, as studied by [IN]. An aspect that matters for operational models is 
the availability of the proxies, since their long-term measurement is not always 
guaranteed. 
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